Contribution of Data Categories to Readmission Prediction Accuracy
نویسندگان
چکیده
Identification of patients at high risk for readmission could help reduce morbidity and mortality as well as healthcare costs. Most of the existing studies on readmission prediction did not compare the contribution of data categories. In this study we analyzed relative contribution of 90,101 variables across 398,884 admission records corresponding to 163,468 patients, including patient demographics, historical hospitalization information, discharge disposition, diagnoses, procedures, medications and laboratory test results. We established an interpretable readmission prediction model based on Logistic Regression in scikit-learn, and added the available variables to the model one by one in order to analyze the influences of individual data categories on readmission prediction accuracy. Diagnosis related groups (c-statistic increment of 0.0933) and discharge disposition (cstatistic increment of 0.0269) were the strongest contributors to model accuracy. Additionally, we also identified the top ten contributing variables in every data category. Introduction Hospital readmission is an important metric of inpatient care quality and a major contributor to healthcare costs. Nearly 20% of hospitalized Medicare beneficiaries are readmitted to the hospital within 30 days of discharge. For example, in 2009, the Medicare Payment Advisory Commission (MedPAC) found that Medicare spends about $12 billion per year on preventable readmissions. In order to solve this problem, the Centers for Medicare and Medicaid Services (CMS) uses readmission rate as a criterion to penalize the hospitals whose readmission rate exceeds the expected threshold. In fiscal year 2017, more than half of the nation’s hospitals (2597 hospitals) will be penalized for excessive readmission rates. The average penalty is 0.71% of the hospital’s total Medicare reimbursement, and it can be as high as 3% depending on how far the rate of readmission exceeds the threshold. Additionally, readmission rate is also an important metric to evaluate the quality of healthcare. In the United States, readmission rate was selected as a significant indicator to measure the quality of healthcare in the National Quality Forum of 2008. In the United Kingdom, readmission rate within 28 days after discharge is used to measure the quality of healthcare. There are three categories of approaches to decreasing readmission rates: pre-discharge interventions, post-discharge interventions and bridging interventions. Pre-discharge interventions may include patient education, discharge planning, medication reconciliation and follow-up appointment scheduling before discharge. Post-discharge interventions may include close follow-up, timely PCP communication, follow-up telephone call, patient hotline and home visits. Bridging interventions may include transition coach, patient-centered discharge instructions and provider continuity. However, it is very expensive to apply these interventions to every patient. Thus, it is significant to identify the patients at high risk for readmission in order to make these interventions cost-effective. Electronic Medical Record (EMR) data provide abundant information to predict readmission risk. There are many published studies in this field. Hasan, Omar, et al. performed logistic regression analysis to identify significant predictors of unplanned readmission within 30 days of discharge and developed a scoring system for estimating readmission risk. Shadmi, Efrat, et al. developed a prediction score based on before admission electronic health record and administrative data using a preprocessing variable selection step with decision trees and neural network algorithms. Yu, Shipeng, et al. proposed a generic framework for institution-specific readmission risk prediction, which takes patient data from a single institution and produces a statistical risk prediction model optimized for that particular institution and, optionally, for a specific condition. Chen, Robert, et al. implemented a cloud-based predictive modeling system via a hybrid setup combining a secure private server with the Amazon Web Services Elastic Map-Reduce platform. Greenwald, Jeffrey L., et al. designed a 30-day readmission risk prediction model through identification of physical, cognitive, and psychosocial issues using natural language processing. However, these investigations provide little information on the comparative importance of different data categories on the accuracy of the readmission prediction models. We therefore conducted a study to analyze the contributions of different data categories to the discriminative ability of readmission prediction.
منابع مشابه
A study to reduce readmissions after surgery in the Veterans Health Administration: design and methodology
BACKGROUND Hospital readmissions are associated with higher resource utilization and worse patient outcomes. Causes of unplanned readmission to the hospital are multiple with some being better targets for intervention than others. To understand risk factors for surgical readmission and their incremental contribution to current Veterans Health Administration (VA) surgical quality assessment, the...
متن کاملJoint Impact of Clinical and Behavioral Variables on the Risk of Unplanned Readmission and Death after a Heart Failure Hospitalization
Most current methods for modeling rehospitalization events in heart failure patients make use of only clinical and medications data that is available in the electronic health records. However, information about patient-reported functional limitations, behavioral variables and socio-economic background of patients may also play an important role in predicting the risk of readmission in heart fai...
متن کاملLanguage Learning Strategy Use and Prediction of Foreign Language Proficiency Among Iranian EFL Learners
The purpose of this study was twofold. Firstly, it attempted to investigate whether language learning strategies (LLSs) can predict foreign language (FL) proficiency. Secondly, it examined what kind of LLSs Iranian learners of English use more frequently in FL institutes. To do so, 112 intermediate Iranian EFL learners participated in the study. Oxford’s Strategy Inventory for Language Learning...
متن کاملTidal prediction using time series analysis of Buoy observations
Although tidal observations which are extracted from coastal tide gages, have higher accuracy due to their higher sampling rate, installing these types of gages can impose some spatial limitation since we cannot use every part of sea to install them. To solve this limitation, we can employ satellite altimetry observations. However, satellite altimetry observations have lower sampling rate. Acco...
متن کاملA Survey on Factors Contributing to Hospital Patient Readmission
Background and Objectives: Patient readmission is an important criterion in performance evaluation of healthcare settings. Identifying the causes of the readmissions would help the policymakers in designing strategies for efficient management of health facilities. This stud aimed to explore factors contributing to readmission by sampling from an Iranian hospital. Methods: Clinical records of ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2018